CDS

Accession Number TCMCG042C39553
gbkey CDS
Protein Id XP_016471689.1
Location join(5211..5495,6309..6434,6514..6553,6643..6725,7324..7392,7503..7556,7797..7925,8079..9704)
Gene LOC107793768
GeneID 107793768
Organism Nicotiana tabacum

Protein

Length 803aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA319578
db_source XM_016616203.1
Definition PREDICTED: splicing factor 1 [Nicotiana tabacum]

EGGNOG-MAPPER Annotation

COG_category A
Description K homology RNA-binding domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03041        [VIEW IN KEGG]
KEGG_ko ko:K13095        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGAGCGCCAAACTTGAGCAGGCATCTAGCCGCATCCAGACGGCTGCTTCATCTGCAACATCATTGAGCAGCCCAAAGATCTCTATGTTTGCCAATAAAACCGGATTTGTGATACCAAAAAACAAGTTAGCAGGTTCATTGGTTCCGGTATATCGAGGAGGGAAAAAAGGAGGTAGTGATTCTGTTAATGAAGAGAGCAGTAAACAGGTGCAAAGGAAAACCAAGTGGGGCCCTGATCTTACGCAAGATACCACTGTCAGAAAAGCAAGAGCCTTGGCATATCAGAGTCGGGTGGATCAAATAACACAGCTACTGAGTTCTAGGACTTTGGAGGGTGAAGGCAGCAAGGACGCACTGTCAGCCTCCCCTGCTAAAGATCATGAGTCTTCGGACCATCAACCTAATGATGAGAGCGTGAGTTCACTGGAACTTGAAAGACAGGAAGCCATTGGGGAGATTCTAAAACTGAATCCCAGCTATAAGCCACCTGCTGGTTATAAGCCTGTACCCAAGGAGGCAAAGATCCCATTACCTATCAAAGAACATCCTGGGTACAATTTTATAGGTTTGATTTTTGGCCCTGCTCATAAGCAACTGGAAAAGGAAACTGGAGCTCAAGTAAAGGTTTATGGTACCAAAGCAGATACTGGAGAGAAGATAGAAGCTACTTCTGGTGAAAATGATTCTGGTGCTTATGAGGAAATGTATGTCCAAGTATCAGCTGAGACATATGAAAAGGTTGACGCTGCAGTTGCTTTAATTGAACTTCTGGTTACCCCAGCTTCAGTTACTCCAGCGTCGACTACCGCAAAATCGTCAGGTGATGGAGAAACTATTTCCGGCGAAGCAACACCAGGCCCAACGACTCCTCCTGTAGTAAATCAAGGGGTGGCTCAACCAGTTGTGGGGACGCAACCTGCTCAATTACAAGGTCATTTTCAGCCATATCAAGGACAATGGTTTCCTGGACCTACATCTCAGAATACAGTAACTCCATTTCCAGGACCCATTAACTCCTGGAGTTCTTCAGCATCCCTGGTCAGCAACCCTCACCAAGTATCTCCATCTCCTACCAACCTGTCAAATGCACCCTCACCCTTTGGCCCACCACAGGGTATGGCAGATGGATTTGGTTCAGTTCCGCGAAACCCTTTTGTTAACTCTAGCCCACAGGCACCACCACTAATGCGGCAGCCTTTCATGCCTTCCACTCATCTCGGACAAATTGGTGGACCTAGACATCTGATACCATCTTTAGGGTCTACACCACCTCAATCCAATATGACTCCACCTCAATTTTCTCAGAGCCAACCAAATCCAACAGGGTTTCCACAGGGGTTGAGATCTGTTATGTCTTCAATGCCTCAGTCTATCCCTCCTATGGCATACCCAGACCGACCATTGACCCCAGGCGGGAGCTCTCCTGGATGGTCACAGTCACCATGGAACACCCAGACGGGTCAAGGAAGTCATCATCTTTCTTCACGACCTATGGGCATCTCTACCGCTCCACATTTAGATGTTTCACATGGCCATAATTTAGCACCCCAGTCATCTGGACCAGCTCCATCTACGAATTCCGTTTTTCAATCTCAAACACGTATGCCGCCTCCAATGCATCCATCTTCAGGATCTAACCCTGCTCATTTCTTGAACCACCCTGTATCTAGCGGACAACAAGTCATGCATAGTCTCTCACCTAATCCAAATCATGGAAGTTCTCTTAATATTAATTCCATGAGACCTCCTTCCTCTGGAGCTCCAAAACCACTGCAGGCCAGTAGTGATTTTACATTCCAGCCTCACCATCCACAGAATCCAGCATCTCAAGTTGTTTCCAGGCCAGCTGGGCAATTTGGTTCTCAGGAATTTTCACCTCCAAACCAGATGATGCGCCCTTATCTACGACCAGCAATAGACAGTACAAATCCCCCACCTGTTAATCAAGGATTCCCAAGACCTCTATTAAGCAATCAGATTAATCAGCCAGGACCACACATGTCACCTGATTTCGCCCGAGGACCTGCTGGCCCTCTCCCTCAATTTAGGCACCCGGCATTTTCAAATCAGGGTATAGCTTCACCTACCGGCCCTCAAATGCAACCTCTGAACTTTAGGCCAGCTCCGAATCCTGTAGGTTCTTTCTCTCCTAGAGTAGGAAACCCGATGCCTCTTCAGCAAAATAATCCAACTGCCATGCTTCGACCACAAAATTTTGAGGCTCCCAACAATGTCTTCTTTCGACACAGTAGGCCTGTCTCCAGCTCCTCTGGAGCACATCAAATATATGACCCCTTCTCACCCACTTCTGTCCCTCGGCGTCCTCATCCCGGTGGTAATCCGGCAATGGTAGAAAAACAAGAGAGTGATCCTGAATATGAAGACTTGATGGCCTCTGTTGGAGTTAAATGA
Protein:  
MSAKLEQASSRIQTAASSATSLSSPKISMFANKTGFVIPKNKLAGSLVPVYRGGKKGGSDSVNEESSKQVQRKTKWGPDLTQDTTVRKARALAYQSRVDQITQLLSSRTLEGEGSKDALSASPAKDHESSDHQPNDESVSSLELERQEAIGEILKLNPSYKPPAGYKPVPKEAKIPLPIKEHPGYNFIGLIFGPAHKQLEKETGAQVKVYGTKADTGEKIEATSGENDSGAYEEMYVQVSAETYEKVDAAVALIELLVTPASVTPASTTAKSSGDGETISGEATPGPTTPPVVNQGVAQPVVGTQPAQLQGHFQPYQGQWFPGPTSQNTVTPFPGPINSWSSSASLVSNPHQVSPSPTNLSNAPSPFGPPQGMADGFGSVPRNPFVNSSPQAPPLMRQPFMPSTHLGQIGGPRHLIPSLGSTPPQSNMTPPQFSQSQPNPTGFPQGLRSVMSSMPQSIPPMAYPDRPLTPGGSSPGWSQSPWNTQTGQGSHHLSSRPMGISTAPHLDVSHGHNLAPQSSGPAPSTNSVFQSQTRMPPPMHPSSGSNPAHFLNHPVSSGQQVMHSLSPNPNHGSSLNINSMRPPSSGAPKPLQASSDFTFQPHHPQNPASQVVSRPAGQFGSQEFSPPNQMMRPYLRPAIDSTNPPPVNQGFPRPLLSNQINQPGPHMSPDFARGPAGPLPQFRHPAFSNQGIASPTGPQMQPLNFRPAPNPVGSFSPRVGNPMPLQQNNPTAMLRPQNFEAPNNVFFRHSRPVSSSSGAHQIYDPFSPTSVPRRPHPGGNPAMVEKQESDPEYEDLMASVGVK